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Abstract 

We consider the problem of parameter estimation by observations 
of inhomogeneous Poisson process. It is well-known that if the regular- 
ity conditions are fulfilled then the maximum likelihood and bayesian 
estimators are consistent, asymptotically normal and asymptotically 
efficient. These regularity conditions can be roughly presented as fol- 
lows: a) the intensity function of observed process belongs to known 
parametric family of functions, b) the model is identifiable, c) the 
Fisher information is positive continuous function, d) the intensity 
function is sufficiently smooth with respect to the unknown param- 
eter, e) this parameter is an interior point of the interval. We are 
interested in the properties of estimators when these regularity con- 
ditions are not fulfilled. More precisely, we preset a review of the 
results which correspond to the rejection of these conditions one by 
one and we show how the properties of the MLE and Bayesian esti- 
mators change. The proofs of these results are essentially based on 
some general results by Ibragimov and Khasminskii. 
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1 Introduction 

We start with the classical model of i.i.d. observations. Let Xi,--- ,Xn 
be independent and identically distributed random variables with the den- 
sity function /* (x). We suppose that /* (x) = /("i?, x), where /(■,■) is a 
known function depending on the unknown parameter "i? G = {a, (3). 
We have to estimate {} and to describe the properties of estimators in the 
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asymptotic of large samples (n — > oo). We discuss below two estimators: 
maximum likelihood and bayesian. Let us introduce the likelihood func- 
tion L„('j?, X") = YYj=if{'^:^j)- Then the maximum likelihood estimator 
(MLE) -dn and bayesian estimator (BE) (for quadratic loss function and 
density a priori p (•)) are defined by the equations 

It is well-known that if the conditions of regularity are fulfilled then these 
estimators are consistent, asymptotically normal 

(4 - ^ (0, 1 i^y') , ^(^^^-^^^M (0, 1 (i?)-^) , 

and asymptotically efficient. Here !{'&) — J ^j^f^ d/i (x) is the Fisher infor- 
mation. The proofs you can find in any book on asymptotical statistics, e.g., 
Ibragimov and Khasminski (1981). 

These regularity conditions can be roughly described as follows 

• The density /* (x) of the observed r.v. 's belongs to the parametric fam- 
ily, i.e., there exists a value '&o E — {a, (3) such that /* {x) — f (■j?o, x). 

• The function f [i),x) is one or more times differentiable w.r.t. "d with 
certain majoration of the derivatives. 

• The Fisher information I (■(?) is positive function. 

• The Fisher information I (-d) is continuous function. 

• The model is identifiable: if -di ^ ■§2 then / (i?i, x) 7^ / (■j?2, x). 

• The true value {}q is an interior point of the set Q, i.e., 1)0 ^ a and 

• We can observe all values the random variables Xj. 

• The statistical model is fixed and can not be chosen in some optimal 
way. 

Of course, this list is not exhaustive and the other conditions can be men- 
tioned too. We are interested by the properties of estimators, when the 
similar regularity conditions are not fulfilled for some models of continuous 
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time stochastic processes. More precisely, we replace these regularity condi- 
tions by other conditions and study the properties of estimators under these 
new conditions. This approach allows to understand better the role of each 
regularity condition in the properties of estimators. As the model of observa- 
tions in this work we take inhomogeneous Poisson process. The similar work 
concerning parameter estimation for ergodic diffusion processes was already 
published (see [7]), but it seems that the more detailed exposition of the 
proofs will be useful and it is given here. 



2 Regular case 

We observe n independent trajectories = (Xi,...,X„), where Xj = 
{Xj (t) , < t < r}, of a Poisson process = {X (t) ,0 <t < t} of intensity 
function A=k = {A* (t) ,0 <t < r}, i.e., X (0) = 0, the increments on disjoint 
intervals are independent and 

A (i f /■* 
P{X(t)=A;} = -^exp{-A(t)}, A (t) = j K (s) ds. 

The Poisson process sometimes is defined as a series of events < ti < t2 < 
. . . < Im < T and X (t) , < t < r is the corresponding counting process, 
i.e., X it) is equal to the number of events observed up to time t. The process 
X"^ is cadlag (right continuous with left limits at every point t). 

The same model of observation we obtain in the case of r-periodic Poisson 
process X-^" = {X (s) , < s < T„}, if the intensity function A* (s) is r- 
periodic and T„ = rn. Then we can cut the trajectory X"^" on n pieces 
Xj it) = X (t + (j - 1) r) - X ((j - 1) r) , < t < r with j = 1, . . . , n. If 
we suppose that the period r is known (does not depend on t?). As the 
increments of the Poisson process are independent, this model coincides with 
the mentioned above one. 

The Statistician can suppose that this intensity function belongs to some 
parametric class of functions, i.e., A^, = A,?, where A,? = {A (t?, t) , < t < r} 
with 7? G = (a, (3). Therefore he (or she) obtains the problem of estimation 
of the parameter d by the observations X" of the Poisson process of intensity 
function X^,'d G 6. 

We suppose that the intensity is bounded positive function and hence the 
likelihood ratio function for this parametric family is 

ln\{'d,t) dXj{t) -n J [A (^9, t) - 1] dt I (2) 
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and the MLE -^n and BE for quadratic loss function and prior density 
p{9) ,9 E Q (positive, continuous on 9) are defined by the same equations 

Regularity Conditions: 

1. There exists e Q such that (t) = X {^o,t) ,0 < t < r. 



2. The function ^JX [d, t), < t < r has two continuous bounded deriva- 
tives with respect to -d. 

3. The Fisher information 

0<I(^9)= r477rTdt<oo. 

4. The Fisher information I (-i?) is continuous function. 

5. The condition of identifiabihty is fulfilled: for any u > 

-1 2 



inf 







^/MAt)-VxiM 



dt > 0. 



6. The parameter 'd^ is an interior point of the set = (a, (3). 

7. The process Xj (t) is observed on the whole interval [0, r]. 

8. The model of observed process is fixed, i.e., in the statement of the 
problem the intensity function X^ is given (can not be chosen by the 
statistician). 

Of course, Condition 2 implies 4, but we present both of them, because we 
consider below the case, when 4 is not fulfilled. The properties of estimators 
are described in the following theorem. 

Theorem 1 Let the conditions of regularity be fulfilled, then the MLE dn 
and the BE dn are consistent, asymptotically normal 

v/S {L - *„) ^ (o. ^) . V-n (tf. - h) ^ N (o. j^) 

asymptotically efficient and the moments of these estimators converge too. 
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The proof can be found in Theorems 2.4 and 2.5. 

This proof is essentialy based on the general results obtained by Ibragimov 
and Khasminskii [5] , which we present below in a bit more general situation, 
than we need for this theorem. Let us denote by Z„ (u) the normalized 
likelihood ratio process 



ueUn 



where — > and the rate of this convergence is such that Z„ (u) has some 
non degenerate limit (in distribution) Z{u). Below we suppose that in the 
bayesian case {d is a random variable) the loss function is quadratic and 
the density a priory p (J}) ,{} G (a,/5) is continuous positive function. Let us 
define the random variables u and u by the relations 

f^uZ (m) du 



Z (u) = sup Z (u) , 



u 



j^uZ {u) du 



(3) 



The study of the likelihood ratio 2'„ (■) allows to describe the properties of 
estimators (maximum likelihood and bayesian) and this is illustrated by the 
following theorem. 

Theorem 2 (Ibragimov, Khasminskii) Suppose that the following conditions 
are fulfilled 

1. There exist constants a > 1, 5 > 0, such that for all u G U„ 

¥.^e\Zl'\u^) - Zll\u,)f < B \u, - u,\'' 

2. There exist constants /t > and 7 > such that for all n G U„ 

3. The marginal distributions 

{Zn (Mi) ,...,Zn (Uk)) =^ [Z (mi) ,...,Z (Wfc)) 

and Z (■) attains with probability 1 its maximal value at a unique point 
u. 

Then, the MLE dn and BE dn are consistent, 
and for any p > 



(4) 
(5) 



U, 



E,9 \u\ 



E, 



E^ \u 



r,|P 
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For the proof (essentially more general results) see [S], Theorems 3.1.1 and 
3.2.1. Note that in the case of bayesian estimators it is sufficient that the 
parameter a > 0. 

In the regular case of the Theorem [T] the sequence = n~^^'^ and the 
limit process is 

Z (u) = exp |m C (^o) - y I (^o) | , n G 7^, 

where C (^^o) ~ AT (0, 1 (?9o))- Hence 

^^^'^ - Ar(0,l(^o)-^) 



u 



To check the conditions (jlj) and ([5]) in the case of inhomogeneous Poisson 
processes we use the following estimates (below is the true value and 



7l/2 



11/2 



2 — 2E^^ 



Zn {U2) 
_Zn iUi) 



1/2 



2-2exp<!-- 



n 



dt 



< n 



(6) 



and (t?^ = ^0 + ^) 
= ( exp 



= exp 



irinM^4dX(t)-l 
2 J, X{^o,t) 2 



[X (^„,t)-A(^o,t)]dt 



dt L 



(7) 



The regularity conditions allow to obtain the low and upper estimates 
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which provide immediately (jl]) and i^. Using the direct expansion of the 
functions 

V \/n J Jn V 



and In A ^^^o ^ '^I'^j obtain the following representation of the likelihood 
ratio 

Z„ {u) = exp |nA„ (^o, X") - y I (^^o) + r„| , 



where 



and Tn —>■ 0. This representation provides the convergence of the marginal 
distributions of the process (■) to the marginal distributions of the pro- 
cess Z{-). Therefore all conditions of the Theorem [2] are fulfilled and the 
MLE and BE are consistent, asymptotically normal. Let us remind how the 
weak convergence of the likelihood ratio process provides these properties of 
estimators. 

Suppose that we already have the weak convergence of the stochastic 
processes 

Zn{-)^Z{.) (9) 

in the space of continuous on TZ functions vanishing in infinity. Then ac- 
cording to [3] the asymptotic normality of the MLE can be obtained by the 
following way. 

P (dn - ^o) < x] = P \ sup L X") > SUp L X" 

\^{e-'do)<xJ^\yo^^ j y^{e-'do)>x ^ yyo-, ^ 



= P < sup Zn (u) > sup Zn (u) > > P < SUp Z (u) > SUp Z (u) 

\^u<x u>x ) \^u<x u>x 

where we put d = + u/ ^Jn. 
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For the BE we change the variable 6 = -^q + uj \fn = 



jyp{e)L{9,X-) d9 



^9o + 



1 J^^upi'du)L{i^u,X^)du 



p (9) L {9, X") " ■ /u„ P i^u) L {i^u, du ' 

Then using the convergence p {-^u) ~^ P {'>^o) (according to [5J) we can write 



1?0 



p . Iu„Up{l»u) Znju) du ^ ^ 
L p Zn (u) du 



ff,uZ (u) du 



< X 



fj,Z{u) du ^"'j ^ VI (^0 
because the elementary calculus yield the equality 

uZiu) du = ! u e<(''o)-4i(^o) - ^ ("^o 



R 



R 



Z (u) du. 



R 



Hence 

Moreover, by Theorem [2] 



^^ 



{nl (t9o))^E, 



1(^0 

E|Cr, (t2I(^o))'e^„ 



Eicr 



where C ~ A/" (0, 1) 



3 Misspecified model 

Suppose now that the parametric family {A^, ■(9 G 6} does not correspond to 
the observed process X"', i.e., the value "i^o G ©, such that A* = A^^ does 
not exist, but the statistician nevertheless uses this model to estimate the 
parameter i) {no true model case), i.e., he (or she) calculates the likelihood 
ratio function by ([21), where X" are observations of the Poisson process of 
intensity function A^, (■). It can be shown that the MLE and BE converge to 
the value 



arg inf 



A(^,t) 



1 - In 



A (i?, t) 



A* (t) dt, 



(12) 



A, (t) A, (t) 

which minimizes the KuUback-Liebler distance between the measure P,,, 
which corresponds to the observed process with intensity A* and the para- 
metric family {P^,?? G 6}. Note that if A, (t) = A {^o,t) , < t < r, then 
^* = ^0, i-e., the both estimators are consistent. 
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Moreover if -(9^, is an interior point of the set 0, then these estimators are 
asymptotically normal: 



{^n -^?j^Af (0, Dl) , - =^ Af (0, Dl) . 

Here Dl = dll-^ with 



dt. 



Note that in this case the pseudo-LR function Z„ (u) constructed on the base 
of the wrong parametric model has a different limit 



L{^,+u/,/E,X^) 



u 



2 



Zn{u) = ^^^^-^^^ ^ =exp|<. - -I. 

where ~ 7V(0, (i^). The details of this proof can be found in [6]. See as 
well Yoshida and Hayashi [U]. 

We are interested here by a different problem. The intensity of observed 
process A* (t) can be written as contaminated version of the parametric model 
A* {t) = A {^0, t) + h (t) , < t < T, where h (■) (contamination) is unknown 
function. Hence i)^, = i)^, (h) is the point of the minimum of the KuUback- 
Leibler distance ffT^ . We can put the following question: 

when i?,,, = t^o ? 

i.e., when nevertheless the MLE and BE are consistent? 

We consider two situations. The first one (smooth), when the support 
A C [0, r] of the function h (■) is known and A"^ = [0, r] \ A 7^ 0. We can 
modify the likelihood ratio and write it as 

In L X'^) = V r In A t) l^teA^^dX^ {t) - n f [\ t) - 1] l{teA^}dt, 
Jo Jo 

i.e., we exclude the observations on A and define the MLE -dn and BE -dn with 
the help of this function (we call them pseudo-MLE and pseudo-BE). Then 
we have to check if the set of intensity functions {A ('(9, t) ,t e A'^,^9 G 0} 
satisfies the correspondingly modified regularity conditions. For example, 
the Fisher information 
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and the condition of identifiability : for any u > 



inf 



dt>0 



If these conditions are fulfilled, then the estimators -^n and converge to 
the true value (are consistent) and are asymptotically normal. 

Discontinuous intensity functions. Suppose that intensity of the 
observed process is 

A* (t) = [gi (t) + h (t)] l|t<^„| + [g2 (t) + h2 (t)] l{t>^,}, 

where gi{-) < (72 (■) are known positive functions and the functions /ii (■) , 
/i2 (■) are unknown. We have to estimate the time "i^o of switching of in- 
tensity function (change point estimation problem). The MLE and BE are 
constructed on the base of the model with 



A {'d, t) = gi (t) l{t<^} + g2 (t) l{t>tf}, 



< t < r, 



with the likelihood ratio function ([2]), i.e. as if hi (t) = 0, but the observations 
used in contain, of course, hi{-). The Kullback-Leibler distance fll2p 
for {}<{}() is 



Jkl (^) 



91 (t) 



gi (t) + h (t) 

'° r g, (t) 



In 



91 (t) 



+ 



1?0 



g2 (t) + h (t) 

92 jt) 
92 (t) + h (t) 



1 - In 



- 1 - In 



gi (t) + h (t) 
g2 (t) 



gi {t) + h (t) 
92 (t) 



92 (t) + h2 (t) 



[gi (t) + h (t)] dt 
[g, {t) + h, {t)] dt 

[g2 {t) + h2 [t)] dt 



and the similar expression we have for > 'dQ. It is easy to see that if the 
functions hi{-) satisfy the following condition 

< g^ (t) + h (t) < ~J' < g2 (t) + (t) , (13) 



In^ 



then 



dJKL m 



d^ 



< 0, and 



dJKL m 



1?<1?0 



d^9 



> 0. 



Hence the minimum of this function is reached at the point "t^^: = and 
this provides the consistency of the estimators and If we denote 
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X = g2 (t) / Qi (t), hi = hi (t) / gi (t), then we obtain the following regions of 
consistency for hi 



hi < 



X 



- 1 



h2 > 



X — 1 
Inx 



Inx 



- 1 



— X. 



It is important to note that the values of hi can be sufficiently large. 

It can be shown that the rate of convergence is essentially better than 



variable (see similar results in Dabye and Kutoyants [3] , and Dabye, Farinetto 
and Kutoyants [2]). 

4 Non identifiable model 

Suppose that we have the same model for the different values of the param- 
eter, i.e., A t) = X t) ,1 = 2, . . . , k, where 'di ^ 'di^l ^ i and 'di, 'di E Q 
(too many true models). It is well-known that the MLE converges to the set 
{di, . . . , t^fc} of all true values. 

Let us introduce the Gaussian vector C, = {Ci, . . . Xk) with zero mean and 
CO variance matrix q = (qu) 




converges in distribution to some random 




1/2 TM^MM^M) 

Jo ^i^^,t) 



dt 



where the Fisher informations 





Define two random variables: discrete and continuous ^ = Ym=i -'-{bI;} 
and ^ = Yl'i=i Qh where (we suppose that P {|0| = 101} = 0) 




It can be shown that the MLE and BE have the following limits: 



Moreover 
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where On, On are close to random variables and 



1=1 

The proof is based on the weak convergence of the vector of processes 

Z„ (u) = im) , . . . , , Z« (ui) = ^^^^^^xT) 

to the limit process Z (u) = (Z*^^) [ui] , . . . , Z^'^'^ (uk)), where 



l = l,...,k 



(see details in [6J, Section 4.2). 

Example. Let i} G (0, 3) and the intensity function 

A t) = {^^ - 3^9^ + 27?) t + (2^9 - 3) + 1, < t < 1 

then A (1, t) = t'^ + 1 and A (2, t) = + 1. Hence we have 
^ ^ = l{|Cil>IC2|} + 2{|Ci|<|C2|} and so on. 



5 Null Fisher information 

Suppose that I (i9o) = 0. This means that at one point (true value) the 
function A {^o,t) = for all t G [0,r]. Moreover, suppose that the function 
A (t?, t) is 4 times continuously different iable w.r.t. {} with A {i^o,t) = and 

M^.)-r d.>o. 

7o (3!)' \({lo,t) 
Introduce random variable ( {^o) ~ A/'(0, 13 {^0)). Then we have: 



,I3 (^0; 

The proof is based on the weak convergence 



Zn (u) = L^^^'x-^ ^Z{u) = exp (^0) - y I3 (^0) 



u 



6 
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We have to check the conditions of the Theorem [21 Particularly the estimates 
([H]) are replaced by the estimates 

C\d2-H^< I y\ (^2, t) - t)] \t<C\d2- ^1 f 

Jo 

The limit expression for the bayesian estimator is more complicated. 
Example. Let 

\{'d,t) =^sm'^ (m) + 2, 0<t<l, i9e(-l,l) 

then 1; (0) = 0, / = 1, 2 and I3 (0) = ^. Hence 

n'/' (4 - 0) ^ (10)^/^ C'/', C ~ (0, 1) . 

6 Discontinuous Fisher information 

Suppose that the function A t) has at the point -(^o two different derivatives 
from the left A {"^q ,t) and from the right A [^q, tj such that I (^^q ) 7^ I {'&o) 
and all the other conditions of regularity are fulfilled. Then the MLE is 
consistent, but it is no more asymptotically normal. Let us introduce a 
Gaussian vector ( = with mean zero, E^^ = E^^ = 1 and the 

covariance 

Then with the help of the Theorem [2] it can be shown that the MLE is 
consistent, and y/n (^dn — ^oj =^ C but its limit distribution is a mixture of 
three random variables: 

if C- < 0, C+ < or C- < 0, C+ > and |C- 1 > |C+I 

ifC->o,c+<o, 

if C- > 0,C+ > or C- < 0,C+ > and |C-| < |C+| 

These properties follow from the form of the limit likelihood ratio process 

exp{t.C-l(^o)'^'-TlK-)|, «<0 
exp{«C+I«) ^ u>0 



C- 


K 




0, 






c+ 


If 
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We see that there is an atom at the point 0. This form of the hmit hkehhood 
ratio Z {■) provides as well the limit distribution of the bayesian estimates 



L uZ (u) du 



In ^ (") ' 
Example. Suppose that G (0, 2) and 

A i'd, t) = {'d- 1) [3t l|^<i| + 5t^ l|,>i|] +15, < t < 1, 

then I (1—) = I and I (1+) = | and the MLE has the mentioned above limit 
distribution. 



7 Border of the parameter set 

If the true value -do is on the border of the parameter set = [a,/?], say, 
■t^o = a, then the MLE is consistent, but 

n(dn-a)^^ l|C>o}, C (a) ~ AT (0, 1 (a)) . 

Of course, here 1(a) = I(a^). The estimator is asymptotically half-normal 
with an atom at 0, i.e., with probability 0,5 it takes the value 0. This follows 
from the form of the limit likelihood ratio: 

f «^ 1 

Z (u) = exp <u( (a) — —I (a) > , u > 0. 

For the BE we have the limit 



rf~^ /o°° (^) 

\ / „ Z u]du 



where C - AT (0,1). 

To prove these results we have to check the conditions of the Theorem [2] 
for the likelihood ratio process 

Zn{u)= l^^^ny «eU„= [0,/?v^ 
with the corresponding limit process. 
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8 Cusp type singularity 

Let us suppose that the observed process has intensity function 

X{^,t) = alt-^l"" + Xo, 0<t<T 

where k G (O, |). Then this function is not differentiable at one point t = 
and the Fisher information I = 00. To describe the properties of the 
MLE and BE we introduce the normahzed hkehhood ratio process 

Zn in) = ^ , uelJ^= _ ^ _ 

and the hmit process 

=exp|r^W^^(M)-^^r2|, ueTZ. 

Here (■) is double sided fractional Brownian motion, H = k, + ^ (Hurst 
parameter) and 

2 _ 4a^ sin^ {^ttk) B (1 + k, 1 + k) 



Ao cos (ttk) 

where B (1 + k, 1 + k) is beta function. 

We can check the conditions of the Theorem 2 and to show that the MLE 
and BE are consistent, have the following limits 



where the random variables are defined by the same equations and we 
have the corresponding convergence of moments. (For the proof see Dachian, 

HI)- 



9 Discontinuous intensity function 

Let us suppose that the observed process X" = (Xi (•))••• {■)), where 
Xj (■) = {Xj (t) , < t < T} has the intensity function \{t + ^), 0<t<T 
and the function A (y) is positive and continuously differentiable everywhere 
except at the point r, that is A (r+) — A (r__) = r 7^ 0. The set = [a, (3) C 
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{t — T,t). The likelihood ratio process has discontinuous realizations 
and the MLE is defined now by the following relation 



max 



L f^„+, X") , L fi9„-, X") 1 = sup L X") . 

V / V /J i9g0 



The BE is defined as before. 

The limit process Z (u) for the normalized likelihood ratio 

Zn (u) = ^ = {n{a- ^o) ,n{f3 - ^o)) 



IS 



exp I In 7r+ (m) - [A (r+) - A (r_)] m| , 
exp{ln^|^7r_ (-n) - [A (r+) - A(r_)] u| , u < 



Z(u) 



u>0 



where 7r+ (■) and 7r_ (■) are independent Poisson processes of the intensity 
functions A (r_) and A {t^) respectively. Let us denote by u and u the random 
variables defined by the equations 



max [Z , Z (u—)] = sup Z (u) , u 



f.j^uZ (m) du 



i&i ' Z (u) du 

Then the MLE and BE are consistent, have the following limits 



and the convergence of all moments take place. It is shown that for all 
estimators we have a lower bound on the risks and the bayesian estimators 
are asymptotically efficient. For the proof see [H], section 5.1. 



10 Windows. 

Optimal windows. Suppose that we can have observations on some set 
B C [0, r] of Lebesgue measure /i (B) < /i* < r only. The family of such 
sets we denote as JF^, . Our goal is to find the best window B* and estimator 
= '^n (®*) constructed by the observations X" on this set B*, i.e.; Xj = 
{Xj (if:) , t G B*}. The best is understood as the minimizing the mean square 
error asymptotically 

inf inf (4 (B) - ^9)^ ~ {^^ (B*) - ^?)^ 
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If we fix the set B, then we know that the MLE is asymptotically normal 



n (B) -d^^U (0, Ib {^Y^) , Ib (^) 



A (^9, t) 



Therefore if we use the MLE then the best B* = W [d) corresponds to the 
solution of the following equation 

Ib* {^) = sup Ib . 

To solve this equation we introduce the level sets C(^_r) and function {-d, r) 
as 

Then we define r^, = r^, (i?) as solution of the equation fi{'d,r) = fi^,. Now 
we put B* = C(^^r,)- Of course, we are not obliged to use the MLE and 
moreover, this set B* can not be used for construction of estimator because 
it depends on Nevertheless it allows to introduce the lower bound on the 
risks of all couples {set, estimator): for any -do E Q 

lim lim inf sup nE^ (4 (B) - > Ib* {^o)~^ • 

5->0n^oo Key^,,t,,-&n |tf_tfo|<5 

To construct the asymptotically efficient in this sense couple we first some B G 

JF^, and by observations = (B) , j = 1, . . . , -^[^] | 'we (consistently) 

estimate using some estimator i)^. Then we introduce the observation 
window 

Now we construct the MLE dn-^ and show that 

lim lim sup nE^ i^n {K) - = Ib* (?9o)'' ■ 

For the conditions and proofs see Kutoyants and Spokoiny [H] or [B], 
Section 4.3. 

Example. Let A {'d^t) = [6 + 'i^sin (cut)]^ , < t < r, where r = 271 /uj. 
Then the Fisher information is 



Ib (i?) = 4 I [sin {ut)f dt. 
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Introduce 

Cr = {t : 4 sin (cut)^ > , r Z^*) = n = 4 sin^ 
where (/x^, < r). Then ip = arcsin 





T 


_ ^" 


U 


"r 


.UJ 


2 


u;. 


.2 



3r - /i* 3r + yU^, 
4 ' 4 _ ■ 

Therefore the observations in the optimal window are 

X" = (Xi (B*) , . . . , Xi (B*)) , with Xj (B*) = {Xj (t) , t G B*} 
and the asymptotically efficient estimator is the MLE dn (B*). 

Sufficient windows. The analysis of the proofs of the consistency 
of the MLE and BE in the discontinuous case (see, e.g., [H], Chapter 5) 
shows that the main contribution to the likelihood ratio process is made 
by the observations near the jumps. Suppose that the intensity function 
is X{'^,t) = X(t — {}), where ^ G {a, (3) and < a < P < t. Suppose 
as well that the function A (s) , s G (— /5, r — a) is discontinuous at some 
point r^, and continuous on [— /5, r^,) U (r^,, r — a]. Then it is sufficient to 
keep the observations on the interval B = [a + r^, (3 + r^,] only, i.e.; to use 
Xj (B) = {Xj (t) ,a + T^: < t < j3 + T^:}, j = 1, . . . , n and the properties of 
the MLE and BE (consistency, limit distributions and convergence of mo- 
ments) will be the same as in the case of complete observations on [0, r]. 

Moreover, if we have a consistent and asymptotically normal estimator 
•^n of (say, an estimator of the method of moments), then we can use the 
first [y/n\ observations for preliminary estimation by "t}^ of the window as 

= ~ rC^I^ , + , and then to construct the MLE '&n-y^. and 

bayesian estimator ^n-y^- Note that n^^^ {'^y/n — ^ M (0, cr^). Hence 

Therefore we can have consistent and asymptotically efficient estimators con- 
structed by observations in the window of vanishing size. In regular case such 
effect is difficult to wait. 

Example. Let G (a,/5) C (0, r) and 

X{'&,t) =2at + bl{t>i)}, 0<t<T. 



27r 




T — fi* T + fi^ 

4 ' 4 
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Then 



(r) - ar^ 



is consistent and asymptotically normal estimator of i}. Then we maximize 
the function 



1" ^tf^+n-l/S 
V / In A t) dXj (t) 

-(n- [v^) / [A(^,t)-l]dt 



and construct the MLE. Note that the random variable is independent 
on Xj, j = [v^] + 1, . . . ,n. 



11 Rates of convergence. 

It is interesting to note that if we observe a periodic Poisson process X" = 
{X (t) , < t < n} with the intensity functions A (t? + t) or A where A (■) 
is periodic smooth function (phase and frequency modulations in the optical 
telecommunication theory), then we have {n oo) 

respectively. If A (t) is discontinuous function then for the mentioned two 
cases of modulations we have the different rates 

For the proofs see [6]. 

Therefore it is natural to put the following question: what is the maximal 
possible rate of convergence of the mean square error to zero? Suppose that 
we can choose any function \{{},t) ,{} G [0, 1] , t > satisfying the only 
condition 

< A(i9,t) < L,, 

where L^, > is some given constant. We denote the class of such functions 
as T {L^). It can be shown that 

inf inf sup Etf A^n-^? = e-'^^^^°^^^\ 
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i.e., the best rate is exponential. To prove this equahty we need to prove 
two results. The first one is the lower bound for all A (■) G J-' (L^) and all 
estimators {}n 

sup E^,A |4 - > e"'t^(^+°(^)), 

and the second is to construct an intensity function A* (■) G {L^) and an 
estimator -(9* such that 

sup E^,AJC-^l' = e-'^('+°(')). 

i96[0,l] 

For the proof see Burnashev and Kutoyants [T]. 
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